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What is claimed is : 

1. A document searching apparatus for 

searching a document group ynaving a link relation 
for a document, comprisinc 

a link importance /assigning unit weighting the 
link relation and a;ssigning link importance which 
indicates importance of the document based on the 
weighted link relation to the document; and 

an accessing unit accessing the document based 
on the linjc importance . 



a URL simila] 



2. The document searching^/apparatus as set 
forth in claim 1, 

wherein said link ^^portance assigning unit 
includes : 

:alculating unit calculating 
a URL similarly that is a similarity of URLs 
(Uniform R^ource Locators) that represent the 
document^ 

ierein said link importance assigning unit 
cal(2fulates the link importance based on the URL 
ramilarity and the link relation of the document. 



The document s;^rching apparatus as set 
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forth in claim 1, further comprifeing : 

a keyword extracting unfit extracting text 
parts from the documents and /extracting a keyword 
from the document contents. / 

4 - The document seardhing apparatus as set 
forth in claim 3, / 

wherein said keyword extracting unit 
calculates an occurrence frequency of the keyword 
in the document, and / 

wherein said keyword extracting unit further 
comprises: / 

a keyword - document correlation calculating 
unit calculating the cc/rrelation of the keyword and 
the document based ory the link importance and the 
occurrence frequency pf the keyword. 

5. The document searching apparatus as set 
forth in claim 4 , mart her comprising : 

a monitoring/ unit monitoring accesses from a 
user and generating an access log, and 

wherein saifd keyword - document correlation 
calculating uniyc calculates the correlation based 
on the keywo:yd occurrence frequency, the link 
importance, and the access log. 
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6- The document searching / apparatus as set 
forth in claim 4, further comprisying : 

a document type determining unit determining a 
document type of the documenrt based on the URL 
similarity, the number of links from the document, 
and the number of links to tlye document, 

wherein said keyword /- document correlation 
calculating unit selects the document based on the 
document type and calcuLates the correlation for 
the selected document. / 

7. The document / searching apparatus as set 
forth in one of claimsA, further comprising: 

an index creating unit creating an index for 

accessing the yaocument corresponding to 

pronunciation characters or spelling of the 
extracted keyword. / 

8 . The document searching apparatus as set 
forth in claim 1 , further comprising : 

a selectir^ unit allowing the user to select a 

portion of the pronunciation characters or spelling 
of the keyword, 

wherein/ said index creating unit places less 
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than a predetermined number of/ documents highly 
correlated with the keyword in /the index based on 
the correlation calculated jpy said keyword 
document correlation calculati/g unit, and 

wherein said accessing unit accesses the 
document based on the selected keyword. 

9. The document searching apparatus as set 
forth in one of claims 1, /further comprising: 

a collecting unit dollecting the document from 
a network. / 

10. The document searching apparatus as set 
forth in claim 1, / 

wherein said /link importance assigning unit 
causes the weight /of the link relation between the 
documents with / a high URL similarity to be 
decreased. / 

11. The/ document searching, apparatus as set 
forth in cla/m 1, 

wherein said link importance assigning unit 
causes the document that is linked from important 
document/ and whose URL similarity is low to be 
importarit- 
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12. The document searchd/ng apparatus as set 
forth in claim 1, / 

wherein said link importance assigning unit 
causes the importance of document linked from many 
document whose URL similarity are high to be 
decreased. / 

13. The document searching apparatus as set 
forth in claim 1, / 

wherein the link importance of each documant 
is defined as a soliation of the following 
simultaneous linear equ^ion (1), assuming that Cq 
is constant (the lower limit of the importance that 
depends on each page) for each p ^ DOC and that when 
a page p is linked to /a page q, the link weight Iw 
(p, q) is defined by the formula (2): 

Wq=Cq+ ^Wp*lwUp,q) ... (1) 

/jeRefed(q) / 

lw(p,q) = diff(p,q)/ hdiffipj) = 

^Refip) sim(p,g) y 

/ iM)Sim(p,i) 

/ " ... (2) 

where DOC = / {pi, p2, pN} is a set of 

documents calculaiced for the link importance; Wp is 
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the link importance of the page p; Ref yp) is a set 
of pages linked from the page p; Refefia(p) is a set 
of pages linking to the page p; sig?i(p, q) is the 
URL similarity of the pages p and f\; dif f (p, q) = 
l/sim(p, q). is the difference. 



lifts.*- 



10 



14. The document searchiry^ apparatus as set 
forth in one of claims 1 , 
\',^ wherein the URL similarity is determined based 
on characters of a URL conta:yning a, server address 



1= a 



15. A document index creating apparatus for 
creating an index of a d)6cument group having a link 
relation, comprising : 
15 a link importance assigning unit assigning a 

link importance to lyhe document based on the link 
relation; 

a keyword extracting unit extracting a keyword 
from the document/ 
20 an index creating unit creating an index for 

accessing the/ keyword based on pronunciation 
characters or/ spelling of the extracted keyword; 
and 

an acc^fessing unit accessing document assigned 
25 the link Importance corresponding to the keyword 



V 
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when the pronunciatLem characters or spelling of 
the keyword are se^^ected from the index. 

16. The document index cp^ting apparatus as 
set forth in claim 15, 

wherein said link ^.iTmportance assigning unit 
includes : 

a URL similari/6^ s^lculating unit calculating 
a URL similarity tliat is a similarity of URLs 
(Uniform Resource Locators) that represent the 
location of/ the documents in a netowork, 

whenein said link importance assigning unit 
calculaxes the link importance based on the URL 
simiX^Jtrity and the link relation of the document. 



17. A document index creating apparatus for 
creating an index of a documei;n: group having a link 
relation, comprising : 

a link importance assigning unit assigning a 
link importance to th^ document depending on 
whether or not URLs of yche documents are similar; 

a keyword extraoxing unit extracting a keyword 
from the document; ^and 

an index cr^^ating unit creating an index for 
accessing th;e document corresponding to 
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pronunciation characters or spellLng of the 
extracted keyword based on the link iinportance. 



18. A link list creating system for creating a 
link list for a document group having a link 
relation, comprising ; 

a collecting unit coll^fcting the documents 
from a network; 

a link importance assigning unit assigning a 
link importance as of the document an importance 
calculated based on tl^e link relation to the 
document ; 

a URL character string determining unit 
determining a y'RL having a particular 
characteristic of / a character string from the 
documents ; and 

an index cheating unit creating a link list 
for listing less than a predetermined number of 
links to the documents based on the link importance 
and the particular characteristic of the character 



ina tne parti' 
jtring of th^B 



URL. 



19. tr^/ The link list creating system as set 
forth in^^flWrn 18, further comprising: 

a ^document type determining unit determining a 
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document type of the document /based on a URL 

similarity reperesenting the siinilarity of between 

URLs of the documents, ther\nijmber of links to the 

document, and the number of^-^ks from the document, 

wherein said index c/e^ting unit selects the 

document based on the dooument type and creates the 

link list of the selected document. 

/ 

20. A document/searching method for searching 
a document group /having a link relation for a 
document , compris^g : 

assigning ^ link importance as an importance 
of the documeryfc calculated with weighting the link 
relation to t;^e document; and 

access/ng the document based on the link 
importance 



20 
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21. The document searchinq/method as set forth 
in claim 20, further compri^ng: 

calculating a / ^iP^ similarity that is a 



similarity of URLs 
represent the 



ni^^form Resource Locators) that 
nt; and 



cume 

calcul^ing the link importance based on the 
URL sirjKLlarity and the link relation of the 
docuipent 



22. The document /searching method as set forth 
in claim 20, further aomprising: 

extracting a keyword from the document . 

23. The document searching method as set forth 
in claim 20, further comprising: 

calculating an occurrence frequency of the 
keyword in the document , and 

calculating /bhe correlation of the keyword and 
the document based on the link importance and the 
occurrence f requJency of the keyword . 

24. The document searching method as set forth 
in claim 23, farther comprising: 

monitoring accesses from a user and generating 
an access log/ and 

calculates the correlation based on the 
keyword occuirrence frequency, the link importance, 
and the acce/ss log. 



25. The document searching method as set forth 
in claim 23, further comprising: 

deterrjiining a document type of the document 
based on tpe URL similarity, the number of links to 



the document, and the nunper of links from the 
document; and 

selecting the document based on the document 
type and calculating jfche correlation of the 
selected document . 

26. The document sjbarching method as set forth 
in one of claims 22, /further comprising the step 
of: 

creating an indei for accessing the document 
corresponding to nronunciation characters or 
spelling of the extracted keyword. 

27. The document searching method as set forth 
in claim 26, further? comprising the steps of: 

placing less than a predetermined number of 
documents which ar^ correlated with the keyword in 
the index; j 

selecting a portion of the pronunciation 
characters or spelling of the keyword; and 

accessing tine document corresponding to the 
selected portion/ of the pronunciation characters or 
spelling of the /selected keyword. 

28. The document searching method as set forth 



in one of claims 20, further comprisin;^ the step 
of: 

collecting the document from a ne^Work. 

29, A link list creating methoc/^for creating a 

link list for a document grouf)/ having a link 

relation, comprising the steps of:/ 

/ 

colleting the document from^/a network; 

assigning a link importance which indicates 
inmortance of the document to^the document based on 
the link relation; / 

determining a URL ^^having a particular 
characteristic of a charac/er string from the URLs 
of each document; and / 

creating a link li^t for listing less than a 

predetermined number of/ links to the document based 

on the link impedance and the particular 

characteristic of the'^ character string of the URL. 

/ 
/ 
/ 
/ 

30. The link/list creating method as set forth 

in claim 29, furt/her comprising the steps of: 

/ 

determining a document type of the document 

based on the similarity, the number of links to 

the documentyv and the number of links from the 
/ 

document; ar/d 
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selecting the document based on the docfument 
type and creating the link list for the /elected 
document based on the link importance/ and the 
particular characteristic of the character string 
of the URL. 




31. A computer readable record medium for 

recording a program that causes a computer to 

/ 
/ 

execute a process for creatir^^ a link list for a 
document group having a Ixn)/ relation, the program 
comprising the steps of: 

colleting document^^ f rom a network; 

assigning a linly^ importance which indicates 
inmortance of the dc/cument to each document based 
on the link relatioi^; 

determining / a URL having a particular 

characteristic 0f a character string from the URLs 

of documents ; ,^nd 
/ 

creating a link list for listing less than a 
predetermined number of links to the documents 
based on/ the link importance and the particular 
characteristic of the character string of the URL. 



